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Description 

[0001] The present invention relates to novel nucleic acid chain extension terminators, their use in nucleic acid se- 
quencing and synthesis, respectively, as well as a method for preparing such compounds. 

5 [0002] Today, there are two predominant methods for DNA sequence determination: the chemical degradation meth- 
od (Maxam and Gilbert, Proc. Natl. Acad. Sci. 74:560-564 (1977), and the dideoxy chain termination method (Sanger 
et al., Proc. Natl. Acad. Sci. 74:5463-5467 (1977)). Most automated sequencers are based on the chain termination 
method utilizing fluorescent detection of product formation. In these systems either primers to which deoxynucleotides 
and dideoxynucleotides are added are dye-labelled, or the added dideoxynucleotides are fluorescently labelled. As an 

10 alternative, dye labelled deoxynucleotides can be used in conjunction with unlabeled dideoxynucleotides. This chain 
termination method is based upon the ability of an enzyme to add specific nucleotides onto the 3' hydroxyl end of a 
primer annealed to a template. The base pairing property of nucleic acids determines the specificity of nucleotide 
addition. The extension products are then separated electrophoretically on a polyacrylamide gel and detected by an 
optical system utilizing laser excitation. 

15 [0003] Although both the chemical degradation method and the dideoxy chain termination method are in widespread 
use, there are many associated disadvantages. For example, the methods require gel-electrophoretic separation. Typ- 
ically, only 400-800 base pairs can be sequenced from a single clone. As a result, the systems are both time- and 
labor-intensive. Methods avoiding gel separation have been developed in attempts to increase the sequencing through- 
put. 

20 [0004] Sequencing by hybridization (SBH) methods have been proposed by Crkvenjakov (Drmanac et al., Genomics 
4:114 (1989); Strezoska et al., (Proc. Natl. Acad. Sci. USA 88:10089 (1991)), Bains and Smith (Bains and Smith, J. 
Theoretical Biol. 135:303 (1988)) and in US-A-5,202,231. This type of system utilizes the information obtained from 
multiple hybridizations of the polynucleotide of interest, using short oligonucleotides to determine the nucleic acid 
sequence. These methods potentially can increase the sequence throughput because multiple hybridization reactions 

25 are performed simultaneously. To reconstruct the sequence, however, an extensive computer search algorithm is re- 
quired to determine the most likely order of all fragments obtained from the multiple hybridizations. 
[0005] The SBH methods are problematic in several respects. For example, the hybridization is dependent upon the 
sequence composition of the duplex of the oligonucleotide and the polynucleotide of interest, so that GC-rich regions 
are more stable than AT-rich regions. As a result, false positives and false negatives during hybridization detection are 

30 frequently present and complicate sequence determination. Furthermore, the sequence of the polynucleotide is not 
determined directly, but is inferred from the sequence of the known probe, which increases the possibility for error. 
[0006] Methods have also been proposed which detect the addition or removal of single molecules from a DNA strand. 
[0007] For example, Hyman E.D., Anal. Biochem., 174:423 (1988) discloses the addition of a nucleotide to an im- 
mobilised DNA template/primer complex in the presence of a polymerase and determination of polymerisation reaction 

35 by detecting the pyrophosphate liberated as a result of the polymerisation. 

[0008] Jett et al., J. Biomol. Struct. Dyn., I, p. 301 , 1989 discloses a method wherein a single stranded DNA or RNA 
molecule of labelled nucleotides, complementary to the sequence to be determined, is suspended in a moving flow 
stream. Individual bases are then cleaved sequentially from the end of the suspended sequence and determined by 
a detector passed by the flow stream. 

40 [0009] EP-A-223 61 8 discloses the use of an immobilised DNA template, primer and polymerase exposed to a flow 
containing only one species of deoxynucleotide at a time. A downstream detection system then determines whether 
deoxynucleotide is incorporated into the copy or not by detecting the difference in deoxynucleotide concentrations 
entering and leaving the flow cell containing the complex of DNA template and polymerase. 

[001 0] WO 90/1 3666 proposes a method directly measuring the growth of the template copy rather than determining 
45 jt indirectly from compositions in the flow medium. Only one of the four nucleotides is present at a time, and the po- 
lymerisation events reflecting the incorporation of a nucleotide or not are detected by spectroscopic means (evanescent 
wave spectroscopy, fluorescence detection, absorption spectroscopy) or by the individual nucleotides being labelled. 
[001 1] Similar methods employing labelled 3*-blocked deoxynucleotides where the blocking group is removable and 
which thus permit sequential deoxynucleotide addition/detection steps are disclosed in WO 91/06678, US-A-5,302,509, 
so DE-A-414 1178 and WO 93/21340. However, the necessary 3-blocking groups are either not described in any detail, 
or are not accepted by the required enzyme, or do not permit desired rapid deblocking of the growing template copy 
strand after each polymerisation event. 

[0012] One object of the present invention is to provide novel nucleotide derivatives which may be used as chain 
terminators and which by deprotection may readily be converted into nucleotides or nucleotide analogues that may be 
55 further extended. 

[0013] Another object of the present invention is to provide a method for nucleotide sequence determination using 
the novel chain terminators. 

[0014] Still another object of the present invention is to provide a method of synthesizing oligo- or polynucleotides 
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by means of the novel chain terminators. 

[0015] WO 94/23064 refers to the nucleotide triphosphates that are substituted at 3* hydroxyl by an ester function. 
It has been shown later, in fact by the same authors "Catalytic editing properties of DNA polymerises", B. Canard et 
al., Proc. Natl. Acad. Sci. USA Vol. 92, pp 10859-10863, that DNA polymerases hydrolyse 3-ester bond completely, 
and without exceptions, liberating the free 3' hydroxyl with simultaneous loss of the fluorophore. In consequence, 
addition of a single base becomes impossible, no fluorescent signal remains bound to the primer, and the whole strategy 
for DNA sequencing is bankrupt. 

[001 6] Another object of the present invention is to provide a process of preparing novel chain terminators according 
to the invention. 

[001 7] In accordance with the invention, these objects are achieved by the provision of a chain terminating nucleotide 
or nucleotide analogue having its 3'-hydroxyl group protected by an acetal or thioacetal structure designed in such a 
way that the 3-hydroxyl can be deprotected in a relatively short time in dilute acid, such as hydrochloric acid at pH 2, 
for example. 

[0018] In one aspect, the present invention therefore provides a compound of the general formula I: 



or a salt thereof, such as a trimethylammonium, ammonium, sodium or potassium salt, wherein 
B is a nucleobase, 

X and Z independently are oxygen or sulphur, 

Y is hydrogen, hydroxy or protected hydroxy, such as methoxy, ethoxy or allyloxy, 

R 1 is hydrocarbyl, which optionally is substituted with a functional group, 

R 2 is hydrogen or hydrocarbyl, which optionally is substituted with a functional group, 

A is an electron withdrawing or electron donating group capable of moderating the acetal stability of the compound 
I via L lf 

and L 2 are hydrocarbon linkers, which may be the same or different, L 2 , when present, being either (i) connected 
to L 1 via the group A, or (ii) directly connected to L v the group A then being bound to one of linkers and L 2 , 
F is a dye label, 

Q is a coupling group for F, and 

I, m and n independently are 0 or 1, with the proviso that I is 1 when m is 1, and I is 1 and m is 1 when n is 1. 
[0019] In Formula I above, the nucleobase B may be natural or synthetic. Natural nucleobases include common 
nucleobases, such as adenine, guanine, cytosine, thymine and uracil, as well as less common nucleobases, such as 
xanthine, hypoxanthine or 2-aminopurine. Synthetic nucleobases B are analogues to the natural nucleobases and 
capable of interacting with other nucleobases in a specific, hydrogen bond determined way. 
[0020] The hydrocarbyl groups represented by and R 2 include a wide variety, including straight and branched 
chain alkyl, alkenyl, aryl, aralkyl and cycloalkyl, preferably containing up to 10 carbon atoms. Preferred hydrocarbyl 
groups are primary, secondary or tertiary alkyl, alkenyl or alkynyl groups, especially lower alkyl groups, such as methyl 
and ethyl. The optional functional group substituents on R 1 and R 2 are capable of moderating the lability of the 3' acetal 
group through an inductive effect. Exemplary of such functional groups are tert. amino, nitro, cyano and halogen (flu- 
oride, chloride, bromide, iodide). 

[0021] Detectable moiety, or label F can be chosen from a vast number of such moieties known to those skilled in 
the art. Exemplary such moities are radioactively labeled functions, luminescent, electroluminescent or fluorescent 
labels, and labels that absorb characteristic visible or infrared light. Preferably, F is a fluorescent label. 
[0022] Coupling group Q, when n=0, is a reactive group to which a label F can be coupled, or, when n=1, is the 
derivatized residue of a reactive group used for coupling linker L 2 to label F. A large number of such coupling groups 
capable of reacting with and binding label F via a reactive function thereon are known to those skilled in the art. Ex- 




I 



F^-Z-C-Lt-CA) - (1*2 ) 1- <Q)nr <*> n 
R 2 



3 



EP 0 808 320 B1 



amples of reactive groups are amino, thio, and carboxyl. In some cases, the group Q is derived completely from the 
reactive function on the uncoupled R For example, if the coupling reaction is a substitution reaction, the group reacting 
with F being, for example, a halogen (fluoride, chloride, bromide or iodide), then the group Q will be represented by 
the reactive function present on label F. 
5 [0023] For certain applications of the compounds of Formula I, as will be described below, no detectable moiety will 
be needed, and the label F, and optionally also the group Q, can then be omitted. 

[0024] Electron withdrawing or donating group A is incorporated into the structure as a moderator of acetal stability. 
The group A may represent a part of the chain L r A-L 2 . Representative groups A in that case are amido, sulfoxy, 
sulfone, carbalkoxy (ester), ether, thioether and amino groups. Alternatively, A may be a side substituent for the L r L 2 

10 chain, representative groups then being, for example, cyano, nitro and halogen (halogen including fluoride, chloride, 
bromide and iodide). In this latter case, linker L 1 will donate structure stretching from the acetal carbon to the place of 
the substitution, and linker L 2 will donate structure stretching from this substitution to the group A. It is to be emphasized 
that the specific electron withdrawing and electron donating groups mentioned above are only examples and that many 
more such groups are known and obvious to those skilled in the art. 

15 [0025] The structure of hydrocarbon linker L 1 will be selected with regard to the function A, inductive effects being 
highly depending on distance. While a straight aliphatic (saturated or unsaturated) hydrocarbon chain is preferred, 
branched or cyclic hydrocarbons may be contemplated. Preferably, linker has up to 1 0 carbon atoms, more preferably 
up to 6 carbon atoms. 

[0026] The function of hydrocarbon linker L 2 is to provide, together with linker L 1f A and coupling group Q, for a 
20 sufficiently long distance between the label moiety F and the rest of the compound of Formula I. This is required by 
spatial preferences of the enzymes (polymerases) for which the compound of Formula I acts as substrate or by the 
necessity of avoiding interaction between the label and nucleobase B. It is readily understood that the structure of 
linker L 2 highly depends on the particular enzyme, label F and possibly also nucleobase B that are used. A suitable 
length and structure of linker L 2 will therefore be selected by the skilled person for each particular situation. Similarly 
25 as for linker , a preferred structure for linker L 2 is a straight aliphatic (saturated or unsaturated) hydrocarbon chain, 
although branched or cyclic hydrocarbons may be contemplated. Preferably, linker L 2 has up to 1 0 carbon atoms, more 
preferably up to 6 carbon atoms. 

[0027] In cases where the label F and the coupling group Q may be omitted, linker L 2 may, of course, also be omitted. 
[0028] The compounds of Formula I can be deprotected to exhibit a free 3'-hydroxy group in a relatively short time 
30 under acidic conditions, e.g. with hydrochloric acid at about pH 2. It is understood that by proper selection of the groups 
R 1t R 2 , A, L 1 and L 2 with regard to each other, the deprotection time may be adjusted to a desired range. Under the 
mentioned acidic conditions, the most preferred compounds of Formula I will be deprotected within, say, 0.01 to 15 
minutes. 

[0029] While the compounds of Formula I may be used as pure chain extension inhibitors, or chain terminators, for 
35 example, in DNA sequencing according to the chain termination method, as is per se known in the art, the advantages 
of the compounds are, of course, better benefited from when the convenient deprotection capabilities of the compounds 
are utilized. This is, for example, the case when the compounds I are used in nucleic acid sequencing methods based 
on the sequential incorporation and determination of individual nucleotides in a growing nucleic acid copy strand as 
described in, for example, the aforementioned WO 91/06678, US-A-5,302,509, DE-A-414 1178 and WO 93/21340. 
40 [0030] Another aspect of the invention therefore provides a method for determining the sequence of a nucleic acid, 
which method comprises providing a single-stranded template comprising the nucleic acid to be determined, and at 
least partially synthesizing a complementary nucleic acid molecule in a stepwise serial manner by the addition of 
nucleotides in which the identity of each nucleotide incorporated into the complementary nucleic acid molecule is 
determined subsequent to its incorporation, wherein said nucleotides are compounds of Formula I as defined above, 
45 and wherein the 3-blocking group is removed from the nucleotide after its incorporation to permit further extension of 
the nucleic acid molecule. 

[0031] In one embodiment, a method for determining the sequence of a nucleic acid comprises the following steps: 

(i) providing a single-stranded template comprising the nucleic acid to be sequenced, 
50 (ij) hybridising a primer to the template to form a template/primer complex, 

(iii) subjecting the primer to an extension reaction by the addition of compounds of Formula I with different nucle- 
obases B corresponding the four bases A, C, T and G or analogues thereof, 

(iv) determining the type of the compound of Formula I added to the primer, 

(v) selectively hydrolysing the acetal protective group, and 

55 (vi) repeating steps (iii) to (v) sequentially and recording the order of incorporation of compounds of Formula I. 

[0032] The different compounds of Formula I in step (iii) may be added in sequence, in which case the four different 
compounds I may carry the same label F. Alternatively, the different compounds I have different labels F and are added 
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at the same time. 

[0033] In a preferred embodiment of the method of the invention, the template/primer complex is bound to a solid- 
phase support, such as a sequencing chip, for example. The template may be attached to the solid support via a binding 
linker, which, for instance, is ligated to the 5'-end of the template or incorporated in one of the ends of the template by 
5 polymerase chain reaction (PCR). The binding linker may then be attached to the solid support by use of a streptavidin 
coupling system. Alternatively, the primer may be attached to the solid support. 

[0034] The compounds of Formula I may, of course, also conveniently be used in so-called mini-sequencing (see e. 
g. SyvSnen A-C et al., Genomics 8:684-692 (1990). 

[0035] As is readily understood by a person skilled in the art, the compounds of Formula I may also be used in the 
10 synthesis of nucleotide chains, and another aspect of the invention relates to such use. For example, oligonucleotides 
and polynucleotides may be prepared by successively coupling compounds of Formula I to each other in any desired 
base order with intervening deblocking and using a non-template dependent polymerase, such as a terminal trans- 
ferase. For such synthesis, the groups L 2 , Q and F in Formula I may, of course, be omitted. 
[0036] The compounds of Formula I may be prepared by methods known per se. In a further aspect, however, the 
15 invention provides a particular method for the preparation of a subgroup of compounds of Formula I by direct 3'-OH 
protection, and more particularly compounds of Formula la: 
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or a salt thereof, wherein B, X, Z, R,, R 2 , Q. F, m and n are as defined above, and p and q independently are integers 
from 1 to 10, preferably from 1 to 6, by reacting a compound of Formula II: 
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or a salt thereof, wherein B and X are as defined above, with a compound of Formula I 
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wherein R 1# R 2 , Z and p are as defined above, and R 3 is hydrocarbyl (for example, as defined for and R 2 above), 
to produce a compound of Formula IV: 
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10 

or a salt thereof, wherein B, X, Z, R h R 2 , R 3 and p are as defined above, optionally reacting the latter with a diamine 
H 2 N-(CH2)q-NH 2 , wherein q is as defined above, to produce a compound of Formula V: 



J5 



20 



o 0 x 

II II II p 

HO-P-O-P-O-P-O-i^O B 

OH OH OH \ 7 



V ZR i v 

(CHj) p 
o ,C.^CH 2 , q -NH 2 



wherein B ? X, Z, R 1 , R 2 , p and q are as defined above, and optionally coupling a dye label F to the terminal amino group. 
[0037] In the following, the invention will be illustrated by some non-limiting examples. Reference will be made to 
30 the accompanying drawings, wherein: 

Fig. 1 is a reaction scheme for the enolether synthesis described in Example 1 below. Intermediate and final 
products corresponding to various values of integer "n" in the respective structural formulae are identified by num- 
bers indicated within brackets after the n-vaiues listed under the respective formulae. 

35 Fig. 2 is a reaction scheme for the preparation of 3'-acetal-modified thymidine described in Example 2 below. 

Prepared final compounds corresponding to various values of integer "n" in the respective structural formula are 
identified by numbers indicated within brackets after the n-values listed under the formula. 
Fig. 3 is a graph showing thymidine 3-acetal hydrolysis at pH 4 as log % of remaining acetal versus time in minutes. 
Fig. 4 is a reaction scheme for the one-pot synthesis of 3-acetal protected deoxynucleotide triphosphates described 

40 in Example 4 below. Intermediate and final products corresponding to various values of integer "n" in the respective 

structural formulae are identified by numbers indicated within brackets after the n-values listed under the respective 
formulae. 

EXAMPLE 1 

45 

Synthesis of enolethers for derlvatlzation of nucleotide triphosphates (Fig. 1) 

Step 1 . Esterification of ketoacids 

50 [0038] The appropriate ketoacid (1 eq) was added at once to a large excess of dry methanol (20 eq), previously 
treated with thionyl chloride (0.1 eq). The homogenous mixture was refluxed overnight, evaporated under reduced 
pressure, and partitioned between saturated sodium hydrogen carbonate and dichloromethane. The combined organic 
extracts were dried with magnesium sulphate, evaporated and the residue was distilled under reduced pressure to 
give the appropriate methyl ester in a high yield (80% to 95%). 

55 [0039] The following ketoacids were used: 

1) 4-oxy pentanoic acid (levulinic acid) (Aldrich) 

2) 5-oxy hexanoic acid has not been isolated. Instead a commercial ethyl ester-derivative of this acid (Merck) was 
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transesterified in a reaction analogous to the above general procedure. 

3) 6-oxy heptanoic acid was obtained from 2-methyl cyclohexanol (Merck), according to a published procedure 
(Org. Synth. 31: 3-5, (1951)). 

4) 7-oxy octanoic acid was obtained from 2-acetyl cyclohexanone as described (J. Am. Chem. Soc. 70:4023-4026 
5 (1948)). 

Step 2. Synthesis of the dimethoxyacetal derivatives (compounds 1-4 in Fig. 1) 

[0040] Benzenesutfonic acid (0.01 eq) was added to each of above methyl esters (1 eq) dissolved in methanol (2 
10 eq) and trimethylorthoformate (3 eq). The dark brown mixture was refluxed for 3 h, neutralized by addition of dry 
triethylamine (0.1 eq), and evaporated. The residue was distilled under lowered pressure to yield a pure acetal. 
Compound 1. The physical and NMR data for this acetal correspond well to the data reported previously. 
Compound 2. Yield 85%, bp. 120° (15 mm Hg) 1H NMR (CDCI 3 ): 1.28 (s, 3 H), 1.60-1.69 (m, 4 H), 2.34 (t, 2 H), 3.17 
(s, 6 H), 3.67 (s, 3 H). 

15 Compound 3. Yield 92%, bp. 130-133° (15 mm Hg) 1 H NMR (CDCI 3 ): 1 .25 (s, 3 H), 1.29-1.35 (m, 2 H), 1 .60-1 .70 (m, 
4 H), 2.33 (t, 2 H), 3.17 (s, 6 H), 3.67 (s, 3 H). 

Compound* Yield 87%, bp. 147-149° (15 mm Hg) 1 H NMR (CDCI 3 ): 1.25 (s, 3 H), 1.27-1.38 (m, 4 H), 1.57-1 .67 (m, 
4 H), 2.31 (t, 2 H), 3.16 (s, 6 H), 3.66 (s, 3 H). 

20 Step 3. Synthesis of enolethers (compounds 5-8 in Fig. 1) . 

[0041] A mixture of the appropriate acetal (1 eq) and trimethylortoformate (0.5 eq) was placed in a distillation flask, 
equipped with a 20 cm Vigreux column. Benzenesulfonic acid (0.02 eq) was added and the mixture was refluxed. The 
heating was regulated so that the liberated methanol evaporated slowly. After 4 h the heating was increased and the 

25 dark mixture was fractionated under reduced pressure without previous neutralization of the acidic catalyst. The yields 
of isolated colorless enolethers were in all cases very high (85-95%) but GC and NMR analyses showed the presence 
of starting acetal in proportions from 10 to 30 %. Since these impurities were not expected to influence the derivatization 
of nucleotides no attempts were made to purify them further. The complete assignment of NMR signals was difficult 
because of the presence of starting material and the fact that these asymmetric enolethers exist as several isomeric 

30 forms. Nevertheless, in all spectra a vinylic signal of CH 2 from one isomer at around 4.4 to 4.5 ppm and a vinylic signal 
of CH from the other isomer at 3.85 ppm could be easily observed. 

EXAMPLE 2 

35 Synthesis of 3 1 acetal-modlfied thymidine (Fig. 2) 

[0042] The 5-protected thymidine, 5'-FMOC T (FMOC-fluorenyl methoxycarbonyl), (116 mg, 0.25 mmol) was dried 
by coevaporation with dry acetonitrile (10 ml) and dissolved in dry dioxane (5 ml). To this magnetically stirred solution 
an appropriate enolether (0.5 ml, « 10 eq) prepared in Example 1 was added, followed by trifluoroacetic acid (20 ml, 

40 1 eq). After 45 min, the TLC analysis (Silicagel 60 F254, 10% methanol in chloroform) showed complete consumption 
of the starting material, and dry triethylamine (2 ml) was introduced in order to remove the base-labile FMOC-group. 
After this process was completed (60 min), the whole mixture was evaporated under reduced pressure. The nucleoside 
was separated from the excess of enolether by precipitation from petroleum ether, and the precipitate, dissolved in 5 
ml of dry methanol, was treated with 1 ,3-diaminopropane (2 ml, 100 eq) at 60° for 180 min in order to effect the ami- 

45 nolysis of the ester function on the acetal moiety. Finally, the reaction mixture was evaporated under low pressure (oil 
pump) and the crude material was flash chromatographed on a silica gel column, equilibrated in ethanoi and using a 
step gradient of cone, ammonium hydroxide (0-10%) in ethanoi. The material, homogeneous by TLC (developed in 
1-butanol-ammonium hydroxide 8:2), was combined, evaporated, and coevaporated with toluene, to give the NMR 
pure product (compounds 9-12 in Fig. 2) in high yield (72 to 85%). The pure material was stored as a stock solution in 

50 methanol after addition of three drops of ammonia. 

EXEMPLE 3 

Acidic hydrolysis (pH 4) of thymidine, substituted at the 3'-position with different acetyl groups 

55 

[0043] A reference acetate buffer pH 4.0, prepared by mixing of solutions of sodium acetate (0.20 M, 18.0 ml) with 
acetic acid (0.20 M, 82.0 ml) was used in all hydrolysis studies. To this buffer (10.0 ml) an ethanolic solution of the 
appropriate thymidine 3-acetal derivative prepared in Example 2 (compounds 9-12 in Fig. 2) (100 ml) was added with 
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gentle stirring. At different time points a sample of 0.5 ml was withdrawn and placed in a tube containing 1 5 ml of cone, 
ammonia to raise the pH to 9. Samples were analyzed by FPLC using an anion exchange column (Mono Q-Pharmacia 
Biotech AB) and a gradient system of tetraethyiammonium bicarbonate pH 8.2 for elution (0.05 to 0.75 M). The peak 
areas of starting material and its hydrolysis product thymidine were integrated, and plotted as shown in Fig. 3. 

5 

EXAMPLE 4 

One pot-synthesis of deoxynucleotide triphosphates, protected at their 3'-OH position by a functionalized 
acetal group (Fig. 4) 

10 

[0044] A commercial deoxynucleotide triphosphate (pppdT, pppdC, pppdG, or pppdA) was chromatographed on a 
preparative Mono Q column using a gradient system of tetraethyiammonium bicarbonate pH 8.2 (0.05 to 1 .3 M) to 
obtain the pure triphosphate in the form of a triethylammonium salt. This material was evaporated on a Rotavapor, 
coevaporated with dry acetonitrile (3x2 ml), and dissolved in molecular sieve-dried trimethylphosphate (0.5 ml). The 

15 appropriate enolether (compounds 5-8 in Fig. 1 ) (0.2 ml) and trifluoroacetic acid (1 0 eq as a 1 0% solution in dry dioxane) 
were added. The homogeneous mixture was incubated at 20°C for 60 min, neutralized by the addition of triethylamine 
(100 ml), and precipitated from a 1:1 mixture of petroleum ether and diethyl ether. The oily precipitate was dissolved 
in methanol (2 ml) and 1 ,3-diaminopropane (0.5 ml) or another diamine (1 ,4-diaminobutane, 1 ,6-diaminohexane) (0.5 
ml) was added. The ester function was subjected to aminolysis overnight at 60°C. The mixture was again precipitated 

20 from 1:1 petroleum ether and diethyl ether, washed with diethyl ether, and dissolved in water. The water solution was 
analyzed and preparatively purified on the described anion exchange column. The well resolved products (compounds 
13-28 in Fig. 4) always appear prior to the original deoxynucleotide triphosphate, having a retention time comparable 
to that of the appropriate deoxynucleotide diphosphate (as was found from separate coinjection experiments). An 
aliquot of the isolated 3-acetal-modified derivative was evaporated, treated with 80% acetic acid for 2 min, and, after 

25 evaporation of acid, reinjected to the same chromatographic system. In all cases all starting material was reacted and 
a single product with higher retention time was formed that corresponded to the original deoxynucleotide triphosphate. 
Thymidine triphosphate reacts to form, besides the desired ^-modified derivative, also another product with even 
shorter retention time but that also hydrolyses to the starting triphosphate during acid treatment. It is assumed that this 
product is the bis-3'-0, 4-0-acetal-derivative of thymidine triphosphate. This type of side products was not present in 

30 reactions that used other nucleotide triphosphates. It should atso be stressed that very little products of depurination 
were formed in reactions in which pppdG and pppdA were used. This can be explained by the mild acid applied as a 
catalyst, the large excess of enolether used in the reaction, and the fact that the bases existed in an unprotected (more 
acid resistant) form. 

35 EXAMPLE 5 

Coupling of a fluorophore to the 3'-functionalised deoxynucleotide triphosphate (Fig. 4) 

[0045] This derivatization can be performed with a variety of reactive fluorophores that may differ in their reactivity 
40 and in their optimal reaction conditions. Here is described the procedure for labelling of 3'-functionalized deoxynucle- 
otide triphosphate with fluorescein isothiocyanate. 

[0046] The appropriate nucleotide triphosphate, possessing an amino group attached to the 3 - position of the sugar 
residue via an acetal function (compounds 13-28 in Fig. 4), was evaporated and dissolved in 0.1 M carbonate buffer 
pH 10 (0.5 ml). Fluorescein isothiocyanate (single isomer) (10 eq), dissolved in dimethylformamide (0.25 ml), was 
45 added and the mixture was incubated overnight at 20°C. The reaction mixture was applied on a prototype FPLC Su- 
perdex® gel filtration column (Pharmacia Biotech AB) (equivalent to Sephadex® G-10; Pharmacia Biotech AB), equil- 
ibrated and run in TEA HC0 3 buffer (0.1 M). The fluorescent material appearing in the void was collected (compounds 
29-44 in Fig. 4). As expected, the original deoxynucleotide triphosphate was formed upon action of acid on the fluo- 
rescein-labelled nucleotide. 

50 

EXAMPLE 6 

Stepwise enzymatical incorporation of 3'-modified triphosphates 

55 [0047] Four oligonucleotides were prepared, having sequences corresponding to the M13 sequencing primer, and 
designed to incorporate to their 3-ends T, G, A or C base respectively : 
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(T) 
(G) 
(A) 
(C) 



5 



CGACGTTGTAAAACGACGGCCAG 



(23 
(22 
(21 
(20 



mer) 
mer) 
mer) 
mer) 



5 



CGACGTTGTAAAACGACGGCCA 
CGACGTTGTAAAACGACGGCC 



CGACGTTGTAAAACGACGGC 



10 



15 



20 



25 



30 



35 



40 



45 



50 



[0048] Each of them was radioactively labelled with 32 P using T4 polynucleotidekinase. 

[0049] In order to investigate if the modified nucleotide triphosphates are accepted effectively and specifically by the 
T7 polymerase, each of the labelled primers was subjected to the single step extension conditions. These reaction 
mixtures (20 ml) were composed of the respective primer (0.1 pmol), M13 template (1 pmol), and 3'-modified thymidine 
triphosphate (compound 1 6 in Fig. 4) in a buffer system pH 7.5 containing Tris/HCI (40 mM), MnCI 2 (4 mM), dithiothreitol 
(DTT) (11 mM), sodium isocitrate (29 mM) and sodium chloride (23 mM). The mixtures were denatured at 70°C, cooled, 
and incubated at42°C after the addition of T7 DNA polymerase (1 0 U)for 30 min. All reactions were stopped by addition 
of formamide (20 ml), denatured at 70°C, cooled on ice, and separated on a 6% denaturing sequencing gel. The 
developed autoradiogram showed that one-base extension had taken place solely in the reaction containing 23-mer- 
T (designed to prime the M13 template and incorporate thymidylic acid at its 3'-end). None of the other reactions 
containing primers designed for incorporation of dG, dA and dC, respectively, showed any sign of primer extension. 

EXAMPLE 7 

Sequencing reaction with modified chain terminator 

[0050] All sequencing reactions were performed according to the standard AB1® protocol (Taq Dye Primer Cycle 
Sequencing Kit - protocol 901482). 

[0051] The sequencing reactions consisted of the following components: M13 mp 18 ss-DNA template (1 pmol), 5- 
fiuorescein labelled universal primer (1 pmol, Pharmacia Biotech AB), Ampli Taq DNA polymerase (4U) and ^termi- 
nation mixture in cycle sequencing buffer (Tris-HCI pH 8.9, 80 mM, ammonium sulfate 20 mM, magnesium chloride 5 
mM). Instead of the standard ddT triphosphate, compound 16 in Fig. 4, in different proportions with TTP, was used as 
a chain terminator. After the cycling was completed, an equal volume of formamide was added and the mixture was 
heated for 2 min at 90°C, cooled on ice and loaded onto 6% acrylamide urea gel. Separation and analysis of products 
were performed on an A.L.F. DNA-sequencer (Pharmacia Biotech AB). 

[0052] As a result, a pattern of fluorescent-labelled sequences was obtained. This electrophoretic picture was specific 
for the applied terminator. 



Claims 

1 . A compound of the general structure I: 




R,-Z-C-L,-{A) - (1,2 >i- (Q) m - (F) n 
R 2 



or a salt thereof, wherein 
B is a nucleobase, 

X and Z independently are oxygen or sulphur, 
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Y is hydrogen or hydroxy, which optionally may be protected, 

R 1 is straight and branched chain alkyl, alkenyl, aryl, aralkyl and cycloalkyl, preferably containing up to 10 
carbon atoms, which optionally is substituted with a functional group, 

R 2 is hydrogen or straight and branched chain alkyl, alkenyl, aryl, aralkyl and cycloalkyl, preferably containing 
5 up to 10 carbon atoms, which optionally is substituted with a functional group, 

A is (i) an amido, sulfoxy, sulfone, carbalkoxy, ether, thioether, or amino, when the group A represents a part 
of the chain L r A-L 2 ; (ii) cyano, nitro and halogen, when the group A is a side substituent for the L^-l^ chain. 

L 1 and L 2 are chosen from straight and branched chain alkyl, alkenyl, aryl, aralkyl and cycloalkyl, preferably 
containing up to 1 0 carbon atoms, which may be the same or different, L 2 , when present, being either (i) connected 
10 to L 1 via the group A, or (ii) directly connected to L v the group A then being connected to one of linkers and L 2 , 

F is radioactively labeled functions, luminescent, electroluminescent or fluorescent labels, and labels that 
absorb characteristic visible or infrared light, 

Q is a coupling group for F, and when n=0, Q is a reactive group to which a label F can be coupled, or, when 
n=1 , Q is the derivatized residue of a reactive group used for coupling linker L 2 to label F, such as amino, thio, and 
15 carboxyl or a function reactive in a substitution reaction and present on F and 

I, m and n independently are 0 or 1 , with the proviso that I is 1 when m is 1 , and I is 1 and m is 1 when n is 1 . 

2. The compound according to claim 1, wherein R 1 and R 2 are independently selected from primary, secondary and 
tertiary alkyl, containing up to 10 carbon atoms, optionally substituted with one or more functional groups. 

20 

3. The compound according to claim 2, wherein and R 2 are selected from methyl and ethyl. 

4. The compound according to claim 1 , 2 or 3, wherein A is a part of a chain L r A-L 2 and is selected from amido, 
sulfoxy, sulfone, carbalkoxy, ether, thioether, and amino. 

25 

5. The compound according to claim 1 , 2 or 3, wherein A is a substituent to a chain L r L 2 and is selected from cyano, 
nitro and halogen. 

6. The compound according to any one of claims 1 to 5, wherein is a straight alkyl group, preferably having up to 
30 10 carbon atoms. 



7. The compound according to any one of claims 1 to 6, wherein L 2 is a straight alkyl group, preferably having up to 
10 carbon atoms. 

35 8. The compound according to any one of claims 1 to 7, wherein Y is methoxy, ethoxy or allyloxy. 



9. Use of compounds according to any one of claims 1 to 8 in nucleic acid synthesis reactions, especially for preparing 
oligo- or polynucleotides. 

40 10. Use of compounds according to any one of claims 1 to 8 in minisequencing. 

11 . A method for determining the sequence of a nucleic acid, comprising providing a single-stranded template com- 
prising the nucleic acid to be determined, and at least partially synthesizing a complementary nucleic acid molecule 
in a stepwise serial manner by the addition of nucleotides in which the identity of each nucleotide incorporated 

45 into the complementary nucleic acid molecule is determined subsequent to its incorporation, wherein said nucle- 

otides are compounds of Formula I as defined in any one of claims 1 to 8, and wherein the 3-blocking group is 
removed from the nucleotide after its incorporation to permit further extension of the nucleic acid molecule. 

12. The method according to claim 11 , wherein the 3'-blocking group is removed by acid hydrolysis. 

50 

13. The method according to claim 11 or 12, wherein F in Formula I is a fluorescent label. 

14. The method according to any one of claims 11 to 13, wherein said single-stranded template is bound to a solid 
support. 

55 

15. A process of preparing a compound of Formula la: 
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O 0 * 

II w w _ p 

HO -P-O-P-O-P-O-yOtl 

OH OH OH \ / 

(CH»)p 



< C SN ^CH 2 ) q -(Q) B -(P)n 
0 H 



or a salt thereof, wherein B, X, Z, R,, R 2 , Q, F, m and n are as defined in claim 1, and 
p and q independently are integers from 1 to 10, by reacting a compound of Formula II: 



0 0 x 

8 - !! _ ii 



HO-P-0-P-O-P-O-.o B u 
OH OH OH \ 7 



OH 



or a salt thereof, wherein B and X are as defined above, with a compound of Formula III: 



R 3 -C»CH- {CH 2 ) p . 1 -COOR 3 1IZ 



wherein R,, R 2 , Z and p are as defined above, and R 3 is straight and branched chain alkyl, alkenyl, aryl, aralkyl 
and cycloalkyl, preferably containing up to 10 carbon atoms, to produce a compound of Formula IV: 



0 0* 

ii n it e» 
HO-P-O-P-O-P-O— • O B 

OH OH OH 



IV 

R 2 ' N 

COO A3 



or a salt thereof, wherein B, X, Z, R, , R 2 , R 3 and p are as defined above, optionally reacting the latter with a diamine 
H 2 N-(CH 2 ) q -NH 2 , wherein q is as defined above, to produce a compound of Formula V: 
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^ N ^CH 2 ) q -NH 2 



wherein B, X, Z, R,, R 2 , p and q are as defined above, and optionally coupling a dye label F to the terminal amino 
group. 

Patentanspruche 

1 . Verbindung der allgemeinen Forme! I: 



Oder ein Salz davon, worin 
B eine Nukleobase ist, 

X und Z unabhangig Sauerstoff oder Schwefel sind, 

Y Wasserstoff oder Hydroxy ist, das gegebenenfalls geschutzt sein kann, 

R 1 lineares und verzweigtes Alkyl, Alkenyl, Aryl, Aralkyl und Cycloalkyl mit vorzugsweise bis zu 10 Kohlenstoffa- 
tomen ist t welches gegebenenfalls mit einer funktionellen Gruppe substituiert ist, 

R 2 Wasserstoff oder lineares und verzweigtes Alkyl, Alkenyl, Aryl, Aralkyl und Cycloalkyl mit vorzugsweise bis zu 
10 Kohlenstoffatomen ist, welches gegebenenfalls mit einer funktionellen Gruppe substituiert ist, 
A (i) Amido, Sulfoxy, Sulfon, Carbalkoxy, Ether, Thioether oder Amino ist, wenn die Gruppe A einen Teil der Kette 
L r A-L 2 darstellt; (ii) Cyan, Nitro und Halogen ist, wenn die Gruppe A ein Seitensubstituent der L,-L 2 -Kette ist, 

und L 2 , diegleich oder verschieden sein konnen, ausgewShlt sind aus linearem und verzweigtem Alkyl, Alkenyl, 
Aryl, Aralkyl und Cycloalkyl mit vorzugsweise bis zu 10 Kohlenstoffatomen, wobei L 2 , wenn vorhanden, entweder 
(i) mit l^ uber die Gruppe A verknupft ist oder (ii) direkt mit verknOpfl ist, wobei dann die Gruppe A mit einem 
der Linker und L 2 verknupft ist, 

F fur radioaktiv markierte Funktionen, Lumineszenz-, Elektrolumineszenz- oder Fluoreszenzmarker und Marker 
steht, die charakteristisches sichtbares Licht oder Infrarot-Licht absorbieren, 

Q eine Kupplungsgruppe fur F ist; und wenn n = 0, Q eine reaktive Gruppe ist, an die ein Marker F gekuppelt 
werden kann , oder, wenn n = 1 , Q der derivatisierte Rest einer zur Kupplung von Linker L 2 an Marker F eingesetzten 
reaktiven Gruppe ist, beispielsweise Amino, Thio und Carboxy oder eine Funktion, die bei einer Substitutionsre- 
aktion reagiert und an F vorliegt, und 

I, m und n unabhangig 0 oder 1 sind, mit der EinschrSnkung, dad 1 1 ist, wenn m 1 ist, und 1 1 ist und m 1 ist, wenn 
n 1 ist. 




Rj-Z-C-MA) - (X* > 1- (Q>ar <*> n 
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2. Verbindung nach Anspruch 1, wobei R 1 und R 2 unabhangig ausgewdhlt sind aus prim^rem, sekundaYem und 
tertiarem Alkyl mit bis zu 1 0 Kohlenstoffatomen, welches gegebenenfalls mit ein oder mehreren funktionellen Grup- 
pen substituiert ist. 

5 3. Verbindung nach Anspruch 2, wobei und R 2 aus Methyl und Ethyl ausgewShlt sind. 

4. Verbindung nach Anspruch 1, 2 oder 3, wobei A ein Teil einer L r A-L 2 -Kette ist und ausgewShlt ist aus Amido, 
Sulfoxy, Sulfon, Carbafkoxy, Ether, Thioether und Amino. 

10 5. Verbindung nach Anspruch 1 , 2 oder 3, wobei A ein Substituent einer L r L 2 -Kette ist und ausgewShlt ist aus Cyan, 
Nitro und Halogen. 

6. Verbindung nach irgendeinem der Anspruche 1 bis 5, wobei eine lineare Alkylgruppe mit vorzugsweise bis zu 
10 Kohlenstoffatomen ist. 

15 

7. Verbindung nach irgendeinem der Anspruche 1 bis 6, wobei L 2 eine lineare Alkylgruppe mit vorzugsweise bis zu 
10 Kohlenstoffatomen ist. 

8. Verbindung nach irgendeinem der Anspruche 1 bis 7, wobei Y Methoxy, Ethoxy oder Allyloxy ist. 

20 

9. Verwendung von Verbindungen nach irgendeinem der Anspruche 1 bis 8 bei Reaktionen zur NukleinsSuresynthe- 
se, insbesondere zur Herstellung von Oligo- oder Polynukleotiden. 

10. Verwendung von Verbindungen nach irgendeinem der Anspruche 1 bis 8 zur Minisequenzierung. 

25 

11. Methode zur Bestimmung der Sequenz einer Nukleinsaure, umfassend das Bereitstellen eines einzelstrangigen 
Templats, das die zu bestimmende Nukleinsaure umfafct, und wenigstens teilweises Synthetisieren eines komple- 
mentaren Nukleinsauremolekuls auf schrittweise sequenzielle Art durch die Zugabe von Nukleotiden, wobei die 
Identitat eines jeden in das komplementare Nukleinsauremolekul eingebauten Nukleotids nach dessen Einbau 

30 bestimmt wird, wobei die Nukleotide Verbindungen der Formel I sind, wie in irgendeinem der Anspruche 1 bis 8 

definiert, und wobei die 3-Schutzgruppe von dem Nukleotid nach dessen Einbau abgespalten wird, urn die Wei- 
terveriangerung des Nukleinsauremolekuls zu ermdglichen. 

12. Methode nach Anspruch 11, wobei die 3'-Schutzgruppe durch saure Hydrolyse abgespalten wird. 

35 

13. Methode nach Anspruch 11 oder 12, wobei F in Formel I ein Fluoreszenzmarker ist. 

14. Methode nach irgendeinem der Anspruche 11 bis 13, wobei das einzelstrSngige Templat an einen festen Trager 
gebunden ist. 

40 

15. Verfahren zur Herstellung einer Verbindung der Formel la: 



45 



50 



55 



o o * 
HO -p^P^-P-^-j^ B 
OH OH OH \ / 

V* 1 



oder einem Salz davon, worin B, X, Z, R 1 , R 2 , Q, F, m und n wie in Anspruch 1 definiert sind und p und q unabhangig 
ganze Zahlen von 1 bis 10 sind, indem eine Verbindung der Formel II: 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



fi If It « 

HO-p-o-p-o-p-o-T/OB ri 

OH OH OH \ J 



OH 

Oder ein Salz davon, worin B und X wie oben definiert sind, mit einer Verbindung der Formel III: 



R 2 -C*CH~ (CH 2 )p-l-COOR3 III 



worin R 1f R 2 , Z und p wie oben definiert sind und R 3 lineares und verzweigtes Alkyl, Alkenyl, Aryl, Aralkyl und 
Cycloalkyl mit vorzugsweise bis zu 10 Kohlenstoffatomen ist, zu einer Verbindung der Formel IV: 



0 0* 
HO-P-O-P-O-P-O— * O. B 



o 

II 

■? 

OH OH OH 



^000113 



Oder einem Salz davon umgesetzt wird, worin B, X, Z, R,, R 2 , R 3 und p wie oben definiert sind, letzteres gegebe- 
nenfalls mit einem Diamin H 2 N-(CH 2 ) q -NH 2 , worin q wie oben definiert ist, zu einer Verbindung der Formel V: 



OH OH TyJ 

2'V 



R2' C \ 

(CH^ p 



umgesetzt wird, worin B, X, Z, R 1t R 2 , p und q wie oben definiert sind, und gegebenenfalls ein Farbmarker F an 
die terminale Aminogruppe gekuppelt wird. 



Revendications 

1 . Compose de structure generate I 
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if if If 
HO-P-O-f-O-f-0 

OH OH OH 



r i 

9 v 




ou sel de celui-ci, dans laquelle 
B est une nucleobase, 

X et Z sont independamment oxygene ou soufre, 
Y est hydrogene ou hydroxy, Eventuellement protege, 

R 1 est alkyle, alcenyle, aryle, aralkyle et cycloalkyle a chaine droite ou ramifiee contenant, de preference, 
jusqu'a 10 atomes de carbone, qui peut eventuellement Stre substitue par un groupe fonctionnel, 

R 2 est hydrogene ou alkyle, alcenyle, aryle, aralkyle et cycloalkyle a chaine droite ou ramifiee contenant, de 
preference, jusqu'a 10 atomes de carbone, qui peut eventuellement etre substitue par un groupe fonctionnel, 

A est (i) amido, sulfoxy, sulfone, carbalcoxy, ether, thioether, ou amino, quand le groupe A represente une 
partie de la chaine L r A-L 2 ; (ii) cyano, nitro et halogene, quand le groupe A est un substituant lateral pour la 
chaine L r L 2 . 

L 1 et L 2 , qui peuvent etre identiques ou differents, sont choisis parmi alkyle, alcenyle, aryle, aralkyle et cy- 
cloalkyle a chaTne droite ou ramifiee contenant, de preference, jusqu'a 10 atomes de carbone, L 2 , lorsque present, 
6tant soit (i) lie a L 1 via le groupe A, soit (ii) lie directement a L v le groupe A etant alors lie a I'un des elements de 
liaison L 1 et L 2 , 

F correspond a des fonctions marquees radioactivement, des marqueurs luminescents, electroluminescents 
ou fluorescents, et des marqueurs qui absorbent une lumiere visible ou infrarouge caracteristique, 

Q est un groupe de couplage pour F, et quand n=0, Q est un groupe reactif auquel un marqueur F peut §tre 
couple, ou, quand n=1, Q est le residu derive d'un groupe reactif utilise pour coupler I'element de liaison L 2 au 
marqueur F, tel que amino, thio, etcarboxyle ou une fonction reactive dans une reaction de substitution et presente 
sur F et 

I, m et n sont, independamment, 0 ou 1, a condition que I soit 1 quand m est 1, et que I soit 1 et m soit 1 
quand n est 1. 

Compose selon la revendication 1, dans lequel et R 2 sont independamment choisis parmi un alkyle primaire, 
secondare et tertiaire, contenant jusqu'a 10 atomes de carbone, eventuellement substitue par un ou plusieurs 
groupes fonctionnels. 

Compose selon la revendication 2, dans lequel R } et R 2 sont choisis parmi methyle et ethyle. 

Compose selon la revendication 1 , 2 ou 3, dans lequel A fait partie d'une chaTne L r A-L 2 et est choisi parmi amido, 
sulfoxy, sulfone, carbalcoxy, ether, thioether, et amino. 

Compose selon la revendication 1, 2 ou 3, dans lequel A est un substituant sur une chaTne L r L 2 et est choisi 
parmi cyano, nitro et halogene. 

Compose selon Tune quelconque des revendications 1 a 5, dans lequel L 1 est un groupe alkyle a chaTne droite 
ayant, de preference, jusqu'a 10 atomes de carbone. 

Compose selon Tune quelconque des revendications 1 a 6, dans lequel L 2 est un groupe alkyle a chaTne droite 
ayant, de preference, jusqu'a 10 atomes de carbone. 

Compos6 selon I'une quelconque des revendications 1 a 7, dans lequel Y est methoxy, ethoxy ou allyloxy. 
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9. Utilisation des composes selon I'une quelconque des revendications 1 a 8 dans des reactions de synthase d'acides 
nucleiques, notamment pour preparer des oligo- ou des poly-nucleotides. 

10. Utilisation des composes selon Tune quelconque des revendications 1 a 8 dans le mini-sequencage. 

5 

1 1 . Procede de determination de la sequence d'un acide nucleique, comprenant les etapes consistant a preparer une 
matrice monocatenaire comprenant I'acide nucleique qui doit etre determine, et a synthetiser au moins partieile- 
ment une molecule d'acide nucleique complementaire d'une maniere sequentielle par I'addition de nucleotides 
successifs, I'identite de chaque nucleotide incorpore dans la molecule d'acide nucleique complementaire etant 

10 determinee apres son incorporation, dans lequel lesdits nucleotides sont des composes de Formule I tels que 

definis dans Tune quelconque des revendications 1 a 8, et dans lequel le groupe de blocage en 3' est supprime 
du nucleotide apres son incorporation pour permettre la poursuite de I'allongement de la molecule d'acide nuclei- 
que. 

15 12. Procede selon la revendication 11, dans lequel le groupe de blocage en 3' est supprime par hydrolyse acide. 

13. Procede selon la revendication 11 ou 12, dans lequel F dans la Formule I est un marqueur fluorescent. 

14. Procede selon I'une quelconque des revendications 11 a 13, dans lequel ladite matrice monocatenaire est liee a 
20 un support solide. 

15. Procede de preparation d'un compose de Formule la : 



30 



35 




ou d'un sel de celui-ci, dans laquelle B, X, Z, R 1p R 2 , Q, F, m et n sont tels que definis dans la revendication 
^0 1, et p et q sont, independamment, des nombres entiers de 1 a 10, qui consiste a faire reagir un compose de 

Formule II : 



45 




OH 

ou un sel de celui-ci, dans laquelle B etX sont tels que definis ci-dessus, avec un compose de Formule III : 
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dans laquelle R 1t R 2 , Z et p sont tels que definis ci-dessus, et R 3 est un alkyle, alcenyle, aryle, aralkyle et 
cycloalkyle a chame droite ou ramifiee contenant, de preference, jusqu'a 10 atomes de carbone, pour produire un 
compose de Formule IV : 



OH OH OH 



X3 



IV 



ou un sel de celui-ci, dans laquelle B, X, Z, R 1f R 2 , R 3 et p sont tels que definis ci-dessus, a faire eventuel- 
lement reagir ce dernier avec une diamine H 2 N-(CH 2 )q-NH 2 , q etant tel que defini ci-dessus, pour produire un 
compose de Formule V: 



dans laquelle B, X, Z, R 1f R 2 , p et q sont tels que definis ci-dessus, et eventuellement le coupiage avec un 
marqueur colorant F du groupe amino-terrninal. 
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